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ABSTRACT 

This  work  provides  analytical  results  on  the  canonical  cor¬ 
relation  analysis  (CCA)  of  data  sets  from  two  spatially  sepa¬ 
rated  arrays  of  sensors.  Our  case  studies  cover  both  single 
source  and  multiple  source  signals  in  either  white  or  colored 
noise  fields  for  array  signal  processing.  We  derive  analyti¬ 
cal  expressions  of  the  canonical  correlation  for  these  exam¬ 
ples  and  present  a  computer  simulation  analysis  of  empirical 
canonical  correlations  as  a  function  of  nominal  correlation, 
signal-to-noise  ratio  (SNR),  and  sample  support.  Results  ob¬ 
tained  reveal  an  interesting  fact  that  the  canonical  coefficients 
from  CCA  provide  reliable  information  on  the  spatial  corre¬ 
lation  existing  among  data  sets  from  two  arrays  only  when 
the  SNRs  at  both  arrays  are  reasonably  high.  When  sample 
correlation  matrices  (SCM)  are  used  in  the  empirical  CCA,  re¬ 
liable  correlation  can  be  estimated  from  CCA  asymptotically 
(either  at  high  SNRs  from  both  arrays,  or  with  a  large  number 
of  snapshots  in  comparison  with  array  dimensionality). 

Index  Terms —  canonical  correlation  analysis  (CCA),  ar¬ 
ray  signal  processing, 


for  analyzing  second-order  filtering  and  communication  sys¬ 
tems  over  the  Gaussian  channel  [3,  4].  They  multiplicatively 
decompose  concentration  ellipses  for  second-order  filtering 
and  additively  decompose  information  rate  for  the  Gaussian 
channel.  For  array  applications,  the  asymptotical  property 
of  CCA  has  been  exploited  by  [5,  6]  to  design  effective  so¬ 
lutions  to  the  estimation  of  sources’  directions  of  arrival,  to 
the  determination  of  model  order,  and  to  the  performance  im¬ 
provement  of  existing  subspace  DOA  estimation  methods.  In 
this  work,  we  focus  on  the  application  of  CCA  in  array  signal 
processing  with  emphasis  on  the  impact  of  sources’  SNRs  at 
different  receiving  arrays.  From  several  simplified  but  prop¬ 
erly  formulated  nominal  examples,  we  derive  the  analytical 
expressions  for  the  canonical  coefficients  of  one  and  two  sig¬ 
nal  sources  embedded  in  white  and  colored  noise,  and  analyze 
the  performance  of  empirical  canonical  correlations  as  a  func¬ 
tion  of  nominal  correlation,  SNR,  and  sample  support. 

2.  SINGLE  SOURCE  TWO  ARRAYS 

Given  two  sets  of  data  vectors  in  the  forms  of. 


1.  INTRODUCTION 

Canonical  correlation  analysis  (CCA)  [1,  2]  is  a  standard 
statistical  tool  for  analyzing  and  exploring  the  correlations 
among  two  multivariate  data  measurements.  Our  particular 
interest  in  studying  the  CCA  for  array  applications  is  driven 
by  the  need  to  determine  the  signal  spatial  coherence  across 
two  spatially  separated  arrays  (with  inherent  non-stationarity) 
and  subsequent  coherent  signal  processing  for  source  local¬ 
ization  and  beamforming.  Therefore,  we  put  special  emphasis 
on  the  impact  of  data  quality,  in  terms  of  SNR,  on  the  corre¬ 
lation  revealing  nature  of  the  CCA.  The  non-stationary  nature 
of  our  data  motivates  us  to  study  the  reliability  aspect  of  the 
empirical  CCA  in  comparison  with  the  true  statistic-based 
CCA  when  only  a  finite  amount  of  data  is  available. 

The  CCA,  which  measures  the  cosines  of  principal  angles 
between  two  random  vectors,  has  been  successfully  applied 
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x(i)  =  s  xax(t)  +  n  x(t) 
y(t)  =  syay(t)  +  ny{tY 


t  =  1,2, 


(1) 


At  a  fixed  time  index,  the  data  vectors  x(i)  and  y (t),  of  di¬ 
mensions  Nx  x  1  and  Ny  x  1  respectively,  can  be  treated  as 
snapshots  from  two  spatially  separated  arrays.  We  further  as¬ 
sume  that  they  are  zero-mean  complex  Gaussian  distributed 
with  auto/cross-correlation  matrices. 


kxi  =  E{x(t)xH  (t)}  =  a2x  sxsx  +  Rna>n<1! 

Ryy  =  E{y{t)yH{t)}  =  a^SyS^  +  Rnyrly 
R  xy  =  E{x(t)yH(t)}  =  paxaysxsy  +  R„x„y. 

Here,  parameter  p  =  E{ax(t)a* (t)}  /  axay  denotes  the  spa¬ 
tial  correlation  coefficient  between  two  Gaussian  processes 
ax(t)  and  ay(t),  with  zero  means  and  prescribed  correla¬ 
tion.  When  two  arrays’  spatial  separation  distance  is  greater 
than  the  spatial  coherence  length  of  the  noise  field,  we  can 
safely  assume  R„xny  =  0.  The  augmented  Gaussian  vector 


978-l-4244-2354-5/09/$25.00  ©2009  IEEE 


2113 


ICASSP  2009 


Report  Documentation  Page 


Form  Approved 
OMB  No.  0704-0188 


Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 
VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 


1.  REPORT  DATE 

MAY  2010 


2.  REPORT  TYPE 

N/A 


3.  DATES  COVERED 


4.  TITLE  AND  SUBTITLE 

Does  Canonical  Correlation  Analysis  Provide  Reliable  Information  On 
Data  Correlation  In  Array  Processing? 

6.  AUTHOR(S) 


7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Dept,  of  ECE,  New  Jersey  Institute  of  Technology,  Newark,  NJ  07102, 
USA 


5a.  CONTRACT  NUMBER 


5b.  GRANT  NUMBER 


5c.  PROGRAM  ELEMENT  NUMBER 


5d.  PROJECT  NUMBER 


5e.  TASK  NUMBER 


5f.  WORK  UNIT  NUMBER 


8.  PERFORMING  ORGANIZATION 
REPORT  NUMBER 


9.  SPONSORING/MONITORING  AGENCY  NAME(S )  AND  ADDRESS(ES )  10.  SPONSOR/MONITOR' S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release,  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

See  also  ADM002314.  IEEE  International  Conference  on  Acoustics,  Speech  and  Signal  Processing  (34th) 
held  in  Taipei,  Taiwan  on  19-24  April  2009.  U.S.  Government  or  Federal  Purpose  Rights  License,  The 
original  document  contains  color  images. 

14.  ABSTRACT 

This  work  provides  analytical  results  on  the  canonical  correlation  analysis  (CCA)  of  data  sets  from  two 
spatially  separated  arrays  of  sensors.  Our  case  studies  cover  both  single  source  and  multiple  source  signals 
in  either  white  or  colored  noise  fields  for  array  signal  processing.  We  derive  analytical  expressions  of  the 
canonical  correlation  for  these  examples  and  present  a  computer  simulation  analysis  of  empirical  canonical 
correlations  as  a  function  of  nominal  correlation,  signal-to-noise  ratio  (SNR),  and  sample  support.  Results 
obtained  reveal  an  interesting  fact  that  the  canonical  coefficients  from  CCA  provide  reliable  information 
on  the  spatial  correlation  existing  among  data  sets  from  two  arrays  only  when  the  SNRs  at  both  arrays  are 
reasonably  high.  When  sample  correlation  matrices  (SCM)  are  used  in  the  empirical  CCA,  reliable 
correlation  can  be  estimated  from  CCA  asymptotically  (either  at  high  SNRs  from  both  arrays,  or  with  a 
large  number  of  snapshots  in  comparison  with  array  dimensionality). 


15.  SUBJECT  TERMS 


16.  SECURITY  CLASSIFICATION  OF: 


a.  REPORT 

unclassified 


b.  ABSTRACT 

unclassified 


c.  THIS  PAGE 

unclassified 


17.  LIMITATION  OF 

18.  NUMBER 

ABSTRACT 

OF  PAGES 

SAR 

4 

19a.  NAME  OF 
RESPONSIBLE  PERSON 


Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


z(t)  =  [x7  (t)  yT(t)]T  has  a  correlation  matrix, 

1  V)\T  11; 


xy 


1 1 


=  R, 


R„ 


with  Rss  = 
and  Rnrl  = 


0 

Si/ 


yy 

2 

X 

p*  O' x  cr 


&x  P®xGy 

2 


0 

0 

Sy 

l  H 


nxnx 

0 


R 


‘TlyTly 


Using  a  square-root  decomposition  of  a  Hermitian  matrix, 
R„r  =  Rx^Rfi2  and  R“x  =  Rif /2Rix1/2,  we  can  find 
the  analytical  expressions  for  the  canonical  correlation  coeffi¬ 
cients,  given  the  data  model  in  eq.  (1).  In  principle,  the  CCA 
involves  two  stages  of  data  transformation.  The  first  stage  ac¬ 
complishes  a  task  of  whitening  each  vector-component  of  the 
augmented  vector  z (t).  That  is,  the  block-whitened  data  £(i) 
defined  by. 


C(i)  = 


fl(t) 

Rxx1/2  0 

v(t) 

0 

fs 

S', 

to 

z  (t), 


a  block-whitening  operator 

has  correlation  matrix  of  the  following  form. 


ncc  =  E{c(t)cH(t)}  = 


i 

cH 


c 
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Here  C  =  Rxf  "RxyRj/y2  is  the  coherence  matrix.  The 
second  stage  in  the  CCA  diagonalizes  the  coherence  matrix  C 
via  orthogonal  matrix  transformation,  or  equivalently,  using 
a  transformation  to  produce  decoupled  canonical  coordinates. 
When  the  background  noise  is  white,  we  can  derive  the  eigen- 
structures  for  the  matrices  RXx  and  Ry?y.  Specifically, 

Nx 

2  H  2  v  jfij 

Rra  =  0'xSxSx  +  (7n^I  =  y  _  A 

*= 1 

with  its  eigen-values  and  eigen-vectors  given  by, 
the  principal  pair:  Ax, max  =  max{Ax,,}  =  er2||sx||2  +  of 


=  Vx.l  =  S, 


X  ||=>X||) 


the  minor  eigenpairs:  A X)i  =  <r2  (repeated  eigenvalues) 

-L  ®x:  i  =  2, 3, ... ,  Nx 


_ 1/2 

Hence,  the  spectral  decomposition  of  RXx  becomes, 

1  -P.  +  —  P7-  .  (2) 


R"1/2  = 


XX 


Here  the  matrices  PSx  =  sxs^/||s.x||2  and  Pf  =  I  -  PSx 
are  orthogonal  projection  operators  onto  the  signal  subspace 
(sx)  and  its  orthogonal  complement.  Similarly,  we  have, 

1  _  1 


R-1/2  = 
yy 


=PS„  +  — Pf.  (3) 

r2||c  112  I  „2  On  y 


Consequently,  the  analytical  expression  for  the  canonical 
coefficients  can  be  derived  from  the  singular  value  decompo¬ 
sition  (SVD)  of  the  coherence  matrix  C,  i.e., 

C  =  H  ,.,!  2K,„R,  ,f  2 


pcrx  Oy 
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=  SXSy.  (4) 


This  indicates  that  the  only  non-zero  canonical  correlation  co¬ 
efficient  is  given  by  the  principal  singular  value  of  C, 

H 1)  =  \p\ 


\/crxllsxl|2  +  ol 


V° 
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Py 


t  +  Vx  V  1  +  %  ’ 


(5) 


with  r]x  =  cr2||sx||2/t72^  and  py  =  o2y\\sy\\2 / oly  represent¬ 
ing  the  output  SNR  of  array-x  and  array-//,  respectively.  Be¬ 
sides  the  known  fact  that  the  number  of  sources  impinging  on 
the  arrays  is  indicated  by  the  number  of  non-zero  canonical 
correlation  coefficients  [2],  our  result  in  eq.  (5)  brings  up  the 
important  correlation  revealing  property  of  the  CCA.  That  is, 
only  when  reasonably  high  SNRs  are  observed  in  both  data 
sets,  i.e.  t)X7  r)y  1,  hence  fc(l)  — >  \p\,  can  the  canonical 
correlation  coefficients  truthfully  reveal  the  spatial  coherence 
existing  in  data  sets  from  two  spatially  separated  arrays. 

3.  EXTENSION  TO  COLORED  NOISE  FIELDS 

Quite  often,  colored  noise  fields  are  present  at  each  array  due 
to  the  compact  design  of  arrays.  In  such  applications,  the 
noise  correlation  matrices  R„:r  and  R„m  are  no  longer  diag¬ 
onal.  We  first  introduce  a  preliminary  transformation  matrix 
on  the  augmented  data  vector  z (t)  as  follows, 


z(i), 


'  v(t)  ' 

"  R nl'2  0 

1 fit) 

0  Rif2  

Kcc  =  E{C(t)CH(t)}  = 


a  noise  whitening  operator 

so  that  the  correlation  matrix  of  data  £(t)  becomes. 
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Rl2  R22 
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R12  =  R.^1/2  •  POxOy  Sxsf  •  R-f' 12  ■ 

We  then  adopt  the  standard  block-whitening  procedure  in  the 
CCA  to  the  vector-components  in  data  C(i),  resulting  in  the 
block- whitened  data  vector 
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_ 1/2  _ 

with  Rn  '  and  R22'  “  given  as, 


-1/2 


This  leads  to  the  detailed  spectral  decomposition. 


p>  — 1/2 _ 

Rn  — 


\[°l\ 


R 


-1/2  _ 


inl/2Sx  || 

1 


zPrJ/2sx+  Pr- 


1/2 


(6) 


\J*l  HR^IP  +  i 

This  results  in  a  correlation  matrix 

R«  =  E{zmH(t)}  = 


Pjj-1/2  +  Pr-1/2 

R"a  y  sy 


I  c 

CH  I 


(7) 


with  coherence  matrix 
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The  above  rank-1  coherence  matrix  again  leads  to  a  single 
non-zero  canonical  correlation  coefficient. 


k(l)  =  \p\ 


Vx 


Vy 


(9) 


with  r]x  =  ct“||R; 
2 
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1  +  Vx  \  1  +  Vy 

=  vWRr,‘s,.  and  vv  = 
cjy\\Rnl  2sy\\2  =  UySy~R.~^Sy  being  the  output  SNRs  at 
array-x  and  array-?/  in  colored  noise  fields,  respectively. 
Again,  at  high  SNRs,  r/x ,  Vy  1,  we  have  fc(l)  — >  \p\, 
meaning  that  the  canonical  correlation  coefficient  reveals 
the  spatial  coherence  (normalized)  among  two  sets  of  data 
vectors  from  two  spatially  separated  arrays. 


4.  EXTENSION  TO  TWO  SOURCES  TWO  ARRAYS 

When  two  signal  sources  are  present  in  the  data  sets  from  two 
arrays,  the  correlation  matrices  take  the  following  forms, 

2  H  2  H  2  t 
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Matrix  RIl;  can  be  represented  in  its  orthogonal  eigen-modes 
in  descending  order,  i.e., 

Nx 
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i=l 

where  the  first  two  principal  eigen  modes  span  the  signal  sub¬ 
space,  span(vXii, vx,2)  =  span(sXji, s^^)-  Under  the  com¬ 
monly  held  condition:  ||sXji||  =  || 1|  =  ||sx 1 1,  two  orthog¬ 
onal  vectors,  sXjS  =  sx>i  +  sXj2  and  sXjA  =  sXji  —  sXj2,  can 
be  constructed  from  the  two  signal  modes.  Therefore,  the  two 
principal  eigen-vectors  can  be  formed  as. 
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where  equality  =  holds  when  cr2  1  =  cr2  2  =  cr2.  Accordingly 
the  eigenvalues  of  Rxx  can  be  found  as, 
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Hence  the  spectral  decomposition  of  Rxx  2  and  R,yJ /2 
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where  the  above  identity  =  holds  when  p\  =  p2  =  p,  and. 
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Consequently,  the  coherence  matrix  C  can  be  decomposed  as. 
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Therefore,  the  non-zero  canonical  correlation  coefficients, 
fc(l)  and  k{ 2),  can  be  obtained  as. 
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with  r)x>1  =  ^l|sx,=  ||2/2^a!,  Vy,  1  =  al\\sy,s\\2/2aly, 
Vx,2  =  crx||sx,A||2/2o-^,  and  Vy  2  A  cr^ ||Sj/;A || 2/2ct^  being 
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the  equivalent  output  SNRs  of  two  orthogonal  decomposition 
of  signal  modes  of  array-.!  and  array-/;,  respectively.  Our 
findings  are  verified  once  again  that  at  reasonably  high  SNRs, 
we  have  k( 2)  — ►  k(  1)  — ►  \p\,  meaning  that  the  canonical 
correlation  coefficients  reveal  the  spatial  coherence  among 
two  sets  of  data  vectors  from  two  spatially  separated  arrays. 

5.  SIMULATIONS 

Simulation  results  are  presented  to  demonstrate  our  analyti¬ 
cal  results  on  CCA  in  comparison  with  the  empirical  CCA 
(with  varying  snapshot  numbers  and  SNRs).  We  choose  two 
uniform  linear  arrays  (ULAs)  with  Nx  =  Ny  =  8.  The  im¬ 
pinging  angles  of  the  sources  are  arbitrarily  chosen  and  fixed. 
As  shown  in  the  following  figures,  we  plot  the  values  of  the 
canonical  correlation  coefficients,  both  the  analytical  results 
in  solid  lines  in  eqs.(9)  and  (17),  and  the  simulated  results  in 
scattered  points  where  the  empirical  CCA  are  done  in  com¬ 
bination  with  subspace  pre-processing.  We  have  shown  k(  1) 
and  k( 2)  as  functions  of  the  nominal  spatial  correlation  p, 
under  various  SNRs  per  sample  for  both  single  source  in  col¬ 
ored  noise  fields  (Fig.l)  and  two  sources  in  white  noise  fields 
(Fig-2). 


Fig.  1.  CCA  for  a  single  source  in  colored  Gaussian  noise  field. 
Results  are  for  fc(l)  vs.  \p\  under  varying  SNRs  per  sample,  where 
(a)  16  and  (b)  80  snapshots  are  used  in  the  empirical  CCA. 

6.  CONCLUSIONS 

In  conclusion,  CCA  provide  reliable  information  about  spa¬ 
tial  correlations  existing  among  pairs  of  data  sets  only  when 
SNRs  at  both  arrays  are  reasonably  high,  and  the  sample  sup¬ 
port  is  significantly  larger  than  the  data  dimensions.  Specifi- 


!p| 


Fig.  2.  CCA  for  two  sources  in  white  Gaussian  noise  field.  Results 
are  for  fc(l)  and  k( 2)  vs.  \p\  under  varying  SNRs  per  sample,  where 
160  snapshots  are  used  in  the  empirical  CCA.  The  impinging  angles 
for  the  two  sources  are  15°  and  65°  respectively,  and  ||sa:is;||2/2  = 
8.60,  ||s;EiA||2/2  =  7.40,  ||sHjE||2/2  =  8.64,  and  ||s!/>A||2/2  = 
7.36. 

cally,  canonical  correlations  are  a  function  of  the  true  spatial 
correlations  as  well  as  the  SNRs  of  data  sets  from  both  ar¬ 
rays.  The  quality  of  empirical  CCA  depends  on  data  quality 
(SNRs)  on  both  arrays  and  sample  support. 
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